CONVEX CHAINS IN Z 2 



NATHANAEL ENRIQUEZ 



Abstract: A detailed combinatorial analysis of lattice convex polygonal lines of N 2 joining to 
(n, n) is presented. We derive consequences on the line having the largest number of vertices as well 
as the cardinal and limit shape of lines having few vertices. The proof refines a statistical physical 
method used by Sinai to obtain the typical behavior of these lines, allied to some Fourier analysis. 
Limit shapes of convex lines joining to (n, n) and having a given total length are also characterized. 

1. Introduction 

In 1926, Jarmk found an equivalent of the maximal number of integral points that a 
portion of length n of a strictly convex function can interpolate. He obtained an explicit 
constant times n 2 / 3 . This work was at the origin of many works of Diophantine analysis, 
and we refer the reader to the papers of W. Schmidt [11] and Bombieri and Pila [6] for more 
recent results, discussions and open questions on this subject. 

One may slightly change Jarnik's framework, and consider the set of integral points which 
are interpolated by the graph on [0, n] of an increasing and strictly convex function satisfying 
/(0) = and f(n) = n. It turns out that this question is related to another family of works 
we shall discuss now. 

In 1979, Arnol'd [1] considered the question of the number of equivalence classes of convex 
lattice polygons having a given integer as area (we say that two polygons having their vertices 
on Z 2 are equivalent if one is the image of the other by an automorphism of Z 2 ). Later, 
Vershik changed the constraint in this question and raised the question of the number, and 
typical shape, of convex lattice polygons included in a large box [-N,N] 2 . In 1994, three 
different solutions to this problem were found by Barany [3], Vershik [13] and Sinai [12]. 
To be more precise, let II n be the set of convex polygonal lines T joining to (n, n), we 
shall sometimes refer at convex chains, such that the vertices of T are points of Z 2 and 
the angle between each side of T and the horizontal axis is in the interval [0, vr/2]. They 
prove, firstly, that the cardinal of LT n is of the order exp(n 2/ ' 3 (3(C(3)/C(2)) 1 ^ 3 + o(l)), when 
n goes to infinity. Secondly, they prove that the number of vertices constituting a typical 

n 2/3 

line is equivalent, when n goes to infinity, to ^^g-^^)) 1 / 3 ' ^* na ^y' they prove that the 

limit shape of a typical convex polygonal line is the arc of a parabola, which maximizes the 
affine perimeter. Later, Vershik and Zeitouni [15] proved, for a class of analogous problems, 
a large deviation principle involving the affine perimeter of the line. Finally, Barany and 
Prodromou, while considering the maximal number of vertices for a lattice polygon included 
in a convex set, proved recently in [6] the analog of Jarm'k's result, and they showed that 
the largest number of vertices for an element of LT n is equivalent to 3 (^) 2 ^ 3 . 

The nature of these results shows that this problem is related to both affine differential 
geometry and geometry of numbers. Indeed, the parabola found as limit shape coincides 
with the convex curve inside the square having the largest affine perimeter. Furthermore, 
the appearance of the values of the Riemann zeta function shows the arithmetic aspects 
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of the problem. One could show indeed that if the lattice I? was replaced by a Poisson 
Point Process having intensity one (which can be thought as the most isotropic "lattice" 
one can imagine), the constants (C 2 (3)C(2)) _1 /3 = 0, 749 and 3(C(3)/C(2)) 1 / 3 = 2.702 would 
be merely raised respectively to 1 and 3 in probability (cf. the concluding remarks at the 
end of the paper). 

In section 3, we precise the combinatorial aspect of the result of [3], [12], [13]. We obtain, 
for any positive number c, a logarithmic equivalent of the number of lines having [cn 2 / 3 ] 
vertices. This question is reminiscent of other ones considered, for instance, by Arratia 
and Tavare [2] or Vershik and Yakubovich [14] who were studying combinatorial objects 
(permutations, polynomials over finite field, Young tableaux...) having a specified num- 
ber of summands (according to the setting, we call summands, cycles, irreducible divisors 
etc.). In both cases, like in Sinai's approach, the method, borrowed from classical ideas of 
statistical physics, relies on the introduction of a grand canonical ensemble which endows 
the considered combinatorial object with a parametrized probability measure. Then, the 
strategy consists in calibrating the parameters of the probability in order to fit with the 
constraints one has to deal with. Concretely, in our question, it turns out that one can 
add one parameter in Sinai's probability distribution that makes it possible to take into 
account, not only the location of the extreme point of the chain but also the number of ver- 
tices it contains. The asymptotic of the number of lines having [cn 2//3 ] vertices can finally 
be expressed in terms of the polylogarithm functions LI2 and L13. 

In section 4, we deduce from this analysis the result of [6] about the largest number of 
vertices for an element of II n . We show how the proof of this result, using our analysis, 
suggests the more direct argument "a la Jarmk" presented in [6]. 

In section 5, we derive results about the cardinal and limit shape of lines having few 
vertices, answering a question of Vershik: it turns out that there is a limit shape for lines 
having for instance [y/n\ vertices which is, a little bit surprisingly, still the arc of parabola. 

In section 6, we go back to Jarmk's problem. In addition to Jarmk's result we recover, 
we give the asymptotic of the number of chains, typical number of vertices, and limit shape, 
which is an arc of a circle, in this different framework. 

Furthermore, one may mix both type of conditions and the statistical physical method 
still applies. In section 7, we obtain, for the convex lines joining to (n, n) and having a 
given total length, a continuous family of convex limit shapes which interpolates the diagonal 
of the square and the two sides of the square, going through the above arc of parabola and 
arc of circle. 

2. A ONE-TO-ONE CORRESPONDENCE 

We start this paper by reminding the correspondence between finite convex polygonal 
lines issuing from whose vertices define increasing sequences in both coordinates and finite 
subsets of the set of pairs of coprime positive integers each of them affected with a positive 
integer. 

More precisely, let II denote the set of finite planar convex polygonal lines V issuing from 
such that the vertices of T are points of Z 2 and the angle between each side of T and 
the horizontal axis is in the interval [0, 7r/2]. Now consider the set X of all pairs (xi,X2) of 
coprime positive integers including the pairs (0, 1) and (1,0). 

We present the following correspondence which is borrowed from Sinai [12]: 

Lemma 1. [12] The space II is in one-to-one correspondence with the space Cq{X) of finite 
nonnegative integer-valued functions v(x) on X (where v(x) is said to be finite if u(x) 7^ 
only for finitely many x ). 
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Proof: An element of IT is denned by the set of its sides since the sides are ordered ac- 
cording to their slope. Now, each side is defined by a vector (2/1,2/2) one can write y.( — , — ) 

y y 

where y := g.c.d.fyi, y 2 ). Now (— , — ) G X and we define v(— , — ) = g.c.d.fyi, y 2 ). 

y y y y 

Conversely, given v £ Co (A), we order the (finite) set of points x such that v(x) > 0, 
according to the increasing values of — , and obtain a finite sequence of points x^\ Now, 

the polygon with consecutive sides v{x^)x^ is convex. □ 

3. A DETAILED COMBINATORIAL ANALYSIS 

Let us introduce now the following parametrized probability distribution on Co (A), which, 
by the correspondence of Lemma 1, corresponds to a probability distribution on IT: 

VveC (X), Q Zl , Z2 , x (u) = J] 1 1 2 \ zTz? 



(xi,x 2 )ex 1 + 



\-z x H x 2 2 



~xi x 2 
~1 ~o 



" 11 (Zl Z2 j A 1 + (A-1K^ 2 

(xi,x 2 )ex v 712 

where zi, 22 £ [0, 1[, and A > 0. 

Taking A = 1, the probability Q ZUZ2j i is nothing but the two-parameter probability dis- 
tribution introduced by Sinai. Under Q Zl ,z 2 ,\, the variables {v{x)) x& x are still independent, 
like in Sinai's framework, but follow a geometric law only for A = 1. 

Loosely speaking, compared to Sinai's probability distribution, Q ZXZ2 \ corresponds to the 
introduction of a penalization of the probability by a factor A each time a vertex appears. 

Now, we define n(ni, n 2 , N) the set of polygons with endpoint (ni, n 2 ) having N vertices, 
and we denote by Af(ni, n 2 , N) its cardinal. 

Since Q zljZ2 ,\(v) depends only on ^2 x\v(x), ^2 x 2 v(x), and ^2 li/(x)^o> we deduce 

xex xex xex 

that the conditional distribution induced by Q Zl:Z2t \ on H(ni,n 2 , N) is uniform, and we can 
write 

Af(n 1 ,n 2 ,N) = zr i z 2 n2 X- N ]J { 1 + {X _\^ Z *A Q^A^m, n 2 , A)) (*) 

In order to get a logarithmic equivalent of M{n\, n 2 , A), our strategy is to choose the 
three parameters so that 

x\\z Xl z X2 

E zliZ2tX (J2 x Mx))= Yl n — ^T^TTTTa u^7^ =Ul (1) 

x Xz Xl z X2 

E Zl , Z2 , x (J2x 2 u(x))= £ (i- W Kl + (A-l)W) = " 2 ^ 

xSX (xi,x 2 )6X v 7 1 2 ; 



i- Z o,a(5Z M*)*o) = Yl 1 + (A - l)z Xl z^ 2 ~ N ^ 



\ r xi x s 
_ 1 2 

xex (xi,x 2 )ex 
This will indeed lead to an estimation of Q ZljZ2: \(H(ni, n 2 , A)) which will be much bigger 
than exponentially small (see lemma 2 below). 

Now, our asymptotic setting is the following: n\ = n 2 = n — > +00, and A/n 2 / 3 — > c. Let 
us note that we could have chosen a more general setting, taking n 2 /n\ converging to an 
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arbitrary constant, but this does not add any difficulty and just make the computations less 
readable, whereas the final result can be guessed from the case n\ = ri2. 

Theorem 1. For all A > 0, 

\n{X(n,n, [c(A)n 2 / 3 ])) = n 2 / 3 (e(A) + o(l)) 
where c(A) and e(A) are positive functions on ]0, +oo[, defined by : 



e(A) 



1-A C(2) 1 /3( C (3)-Li 3 (l-A))2/3 

/ C(3) - Li 3 (l - A) \ 1/3 Aln(A)Li 2 (l - A) 1 

V C(2) J 1-A C(2) 1 /3(C(3)-Li 3 (l-A))V3 



Remark: The function e(A) is maximal for A = 1, and, in this case, 

1 



c(l) 



e(l) 



(C(2)C(3) 2 )V3 
= 3 /W V3 



VC(2), 

This implies the results of [3], [13] and [12] about the typical behavior of a convex chain, in 
a straighter manner than by the original arguments. 

We refer the reader to Figure 1 which represents the graphic of (c(A), e(A)) for A g]0, +oo[. 



Proof: Let us consider parameters Zi, so that z\ = z<i = z with z of the form z = 1 



S 

where the new parameter S is itself depending on n, but is bounded from above and below 
by two strictly positive constants. Equality (1) then writes 



\t,(l 5_\n 1/3 (*i+t2) 

t l/3 ^U 1 n i/-i) 

(tl>ta ^i sX a - a - ^a) ral/3(ii+i2) )(i + (a - i)(i - jj,r /3 ^K 



Now, using Theorem 459 of the book of Hardy and Wright [8] , saying that the set X has 
density in N 2 , we get that the previous sum is equivalent to 



/ 1 f + °° f + °° Atie-^+fr) 

\Cm Jo Jo (1 - e-'(*i+*"))(l + (A - l)e-^+^) dtldh 

Using a linear change of variable, Sj := 5ti , and the identity 

XX 1 1 



x n 



(1 -X)(l + (A- l)X) 1-X 1-(1-X)X 
the previous expression writes 

(cP Jo Jo 1 - e-(«i+*0 " l-(l-A)e-(^) dSldS2 ) X " 

Using the series expansion 1 _ ae _| si+S2) = ^2 k>0 a k e~ k ( Sl+S2 \ we get finally the following 
expressions: 

^C^-E^-^kw -«.(•-*)) 

(The series expansion on the left hand side does make sense only if the value of A is 
restricted to [0, 1] , but the expression in terms of the polylogarithm function Li 3 is well 
defined for all A > 0, and coincide with the computed integral) 
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We therefore get a first condition on the parameters S and A: 

53 - ^y(C(3)-Li 3 (l-A)) (5) 

Of course, in our symmetric setting, the equality (2) yields the same condition. Let us now 
write equality (3): 

AH S)n 1 / 3 (t 1 +t 2 ) 

V — 2EZL! = [ cn 2 / 3 l 

X 1 + (A-D(1-^)" 1/3 ^) 

In the same way as previously, we deduce 



2/3 _ c „2/3 



The left hand side of the equality writes: 



A (1 - \) k 1 2/3 = A Li 2 (l-A) 2/3 

^ C (2)^ A; 2 -^2 C(2) !_ A 



Hence, the second condition on the parameters 5 and A writes: 

A Li 2 (l-A) 
<5 2 C(2) 1 - A « ° 

Now, back to J\f(n±,n2, N), we use (★) and replacing zi and z 2 by (1 — ^573), we get: 
Af(n,n, [cn 2 / 3 ]) = (1 - )-^ x -[cn^] x 



exp 



E A)d - ^r~) - m (1 - a - Jj- 3 r 1/3 ^) 



Q Zl:Z2 , x {U(n,n, [cn 2 / 3 ])) 

This leads to 

M(n,n, [cn 2 / 3 ]) = Q ZuZ2tX (U(n,n, [cn 2 / 3 ]))x 

exp [ n 2/3 |2<5-clnA+-^- f ln(l - (1 - A)e^ ( * 1+ * 2) ) - ln(l - e -* (tl+ta) )dtidt 2 + o(l) ] 
\ V C(2) Jr2 J 

Using again the linear change of variable, Sj := Sti and afterwards the series expansion 

-k8(ti+t 2 ) 

ln(l - ae- 5 (* 1+ * 2 )) = - ^ a fc - , we get 

k>l 

, n, [en 2 / 3 ]) — Q ZljZ2 \(JI(n, n, [cn 2//3 ]))x 

exp (n 2 / 3 ^2(5 - c In A + ^y(C(3) - Li 3 (l - A)) + o(l))) 
Using (5) this can be simplified into: 

M(n,n,[cn 2/3 ]) = Q ZuZ2tX (U(n,n, [cn 2 / 3 ])) x exp (n 2 / 3 (36 - cln A + o(l))) 

We now have to remark that, according to a type of local limit theorem proved few 
lines below, the quantity Q zltZ2t \(Il(n, n, [cn 2 / 3 ])) decreases polynomially. We derive the 
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expression of c(A) from (5) and Barany, and the corresponding value of the exponent e(A) = 
3<5-c(A)ln(A). ' □ 

Let us now come back to the estimation of Q ZltZ2j \(Tl(n, n, [cn 2 / 3 ])) which is governed by 

a kind of local limit theorem for the variables x\v{x), X2v(x), and ^u(x)^o- 

xex xex xex 

Lemma 2. In the framework of the proof of Theorem 1, Q zlZ2 ^\(U(n, n, [cn 2 / 3 ])) = 0(^573) 

Proof: like in the case of the usual local limit theorem for the sum of iid random vari- 
ables (see [7]), the proof is based on Fourier analysis. However, a dominated convergence 
argument, like in the local limit theorem, doesn't work and we have to be finer. 

We use a real test function h on R 3 , whose Fourier transform has a compact support, and 
we are led to the proof of 

E[hQ2 xiv(x) ~n,J2 %2v{x) - n, \v{x)^o} ~ [cn 2/3 ])} = °(^) 



In the sequel, we will use, for short, the notations X±, X2, X3 for the random variables 
^2 X xiis(x), X2v(x), ^{y(x)^o}i an d the overline notation Z to denote the centered 
variable Z — E[Z] associated to a random variable Z. We will also use the notation C to 
denote a positive constant whose value may change along the lines. 

We can now transform the previous expectation into J-^ 3 h(ti,t2,t^)djl- Xi ~x 3 (ti,t2,t-j) 
where ft Xi y 2 y 3 denotes the Fourier transform of the distribution measure u- Xl ^ 2 Xs °f 
the triplet of random variables (X\, X2, X3), which is a 27r-periodic function in its three 
variables, since we deal with integer valued random variables. 

Therefore, we have to deal with J"r ^3 l(ti,t2,t^)dfi,Y 1 y 2 x 3 (^!'^ 2 ' wnere l(ti,t2,t^) = 
Ylki k 2 k-i& + *2 + k2, *3 + ks) (let us remark that this sum is actually finite). 
Now, since all the variables v{x) are independent, 

d Axi,X2,X 3 (*l>*2,*3) = (JJ</> x (*l)*2,*3))d*1^2^3 



where $ x {ti,t 2 ,h) = E[e^ tlXl ^ x)+t2X2u( - x)+t3l ^ o) ]. 
Now, denoting by p x = z Xl ^ 2 , we get 

Var(u(r)) _ Xpx((l + Px)(l + (\-l) P x)-\px) 

( ( ]) ~ (l-p,) 2 (l + (A-l)p,) 2 
Cov{l v[x) ^v{x))= {1 + { ^ 1)px)2 

and 

Var ^ x ^ = (i + (A - l)p,) 2 

From these formulas, we deduce the expression of the covariance matrix T x of the random 
vector (xiv(x),X2v(x),1 v ( x ) 1 lq). Now, after summing the matrices T x , and approximating 
the sums on the set X by integrals, we obtain that the covariance matrix of the vector 
Xi, X2, X3, is equivalent when n goes to infinity to 

n 4/3 A n 4/3 B nD 

I ',, - i n 4 / 3 B n 4 / 3 A nD 
nD nD n 2 / 3 E 
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(one can state indeed a result of the form: for two given integers k, I, and for a continuous 

G ' s k lS l 2 f(e~ 5 ^ + ^)d Sl ds 2 



function /, ^ x\x l 2 f(p x 



x 



Jr2 



2 + k + l 

n s as soon as the integral 



on the right hand side is finite.) 

Our task is now to prove that /[_^]3 K*) d Axi,x 2 ,x 3 (*) ~ f ex P( _ 5^ r n*))df 
which is easily seen, after a linear change of variable, to be equivalent to some constant 



times 



1(0) 



Let I n := [— n 2 / 3 log n,n 2 / 3 logn] 2 x [n x / 3 log n,n 1 / 3 logn]. We partition the domain 
[-7T,7T] 3 into/ n U([-7r,7r] 3 \/ n ). 

On I n , 

X 

+ exp(~(t, ]T r x t)) exp(i I^t) + ln^(t)) - 1 

Since T n ~ r x , the integration of the first term of the rhs contributes only like 0(^573 ). 
Concerning the second term of the rhs, 



exp(-J]<f,r x t) + In 



< 



ei 2 



|Ex<Cr,t)+ln0 x (t)| 



Now, using the fact that, on I n ,\<j) x \ is bounded away from 0, we get, on I n , that 

|ln^(t)-(l-^(*))l < |1-W)| 2 

and 

1 3 Ex(W~) + ln^(t)| < ^ |1 - x (t)\ 2 + ^ E E i\& O^O*)- ^(x),T^^))| 3 ] 

x x 

< C [J2(44 + ^ + 4)f(p x ) + ^(|ti| 3 x 3 + |t 2 | 3 * 3 + \t 3 f)g(p x ) 
\ x x j 

for suitable functions / and g. 

This last expression is therefore dominated by a constant times tfn 2 + t 2 n 2 + t 4 n 2 / 3 + 
|^|3 n 5/3 + |£ 2 |3 n 5/3 + | ta |3 n 2/3 ) w hi c h is, on I n , a o(tfn 4 / 3 + t|n 4 / 3 + t§n 2 / 3 ) = o((f,r„t)). 

Therefore, the integral on I n of | \\ x 0xW ~~ exp(— ^(i, r n i))| contributes like o(-^g). 

Finally, it remains to prove that the integral of | Y\x 4>xtf)\ over [ — 7T ^ 7r } 3 \ In is o(-^g). 

Actually, we will prove that, outside 7 n , for all Yl x <fi x {i)\ < exp(— C(lnn) 2 ). 

If we denote t' := (ti, ti), so that i = (f, £3), a straightforward computation gives 



i_Pz gin 2 t 



(l+(X-l) Px ) 



2 (l+(X-l) Px ) 



4Ap^ gin 2 t^x+t 



r - *h£l _ sin 2 k 

2 ^ (l+(A-l)p,)2 2 



4px 



2 t'.a: 

(i^F olii — 



Hence, 



]J ^ ex P ( -^J^ n (Az) sin 2 + u(p x ) sin 2 - ' X ^ ^ - p x v(p 

x V x 



• 2 *3 
sin — 
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with 

4Vx(2 + (A-2 )AB ) 



(l-fe) 2 (l + (A-l)ft) 2 



(l + (A-l) Px )2 
From the simple inequality it(/9a;) > 2p x v(p x ), we deduce 

| n^(*)l < exp (-CJ2pMPx) (2 sin 2 ^ + 1 sin 2 
X \ X \ i p x z 



■ 2 *3 

sm - 



Now, for M > 2, the quadratic form 2x 2 + Mix + y) 2 — y 2 is positive definite, and one 
can deduce that there exists C > 0, such that, 

Vx,y £ R, 2sin 2 x + Msin 2 (x + y) - sin 2 y > Csin 2 x. 



Let us consider, firstly, the case \t'\ G [^73-, ^173 ]• We first observe that, since the functions 
' x I are smaller than 1 , it is enough to get an upper bound of j~J | <j) x (t) \ , where As 

Xn[Ai.n 2 / 3 ,+oo[ 2 

is a positive real number which satisfies e~ 2SAs < 1/3. This implies that, on the considered 



In n 1 

,2/3 ) „l/3 J > 



domain, — > 3, and that , for It'l £ f- 

[J I <exp ( -Cn 2/3 f u(e- s ^ + ^)sm 2 (n 1 / 3 (t , .s))ds 1 ds 2 ) 

< exp(-C(lnn) 2 ) 

In the case \t'\ > ^73, and 1*3 1 < ^73", the number of points of X such that t'.x belongs 
to the same period of the sinus is less than n 1 / 3 , so that when we estimate the sum on X by 
an integral on [0, +00 [ 2 , the terms sin 2 * ' x ^ tz and sin 2 can be replaced by some positive 
constant times their mean value i.e. \. 



Hence, 



[J I0x(*)l < exp ( -Cn 2 / 3 / u(tr^+ s ^)ds x ds 2 



Xn[A 5 .n 2 /3 i+0O [2 

< exp(-Cn 2 / 3 ) 

The last case, when 1*3 1 > ^73- and \t'\ < ^§75, is treated in the same way as the first 
case, after observing that, for M > 2, 

Vx, t/6l, 2 sin 2 x + M sin 2 (x + y) — sin 2 y > C sin 2 y. 

□ 



4. The cardinal of the longest convex chain 



In this section we are seeking an equivalent of the maximal number of vertices for an 
element of LI n , or equivalently, the maximal number of integral points that an increasing 
convex function satisfying /(0) = and /(n) = n can interpolate. This question and its 
counterpart, concerning the maximal convex lattice polygons incribed in a convex set , was 
recently solved in [6]. 

We present a first proof, using Theorem 1, and afterwards the proof of [6], which is more 
direct. Moreover, we show how the first proof can suggest the second one. Let us mention 
here that the aim of [6] was to get an equivalent of the maximal cardinal of the vertices of 
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a lattice polygon included into a given convex set. The following result was the stepping 
stone of their proof. 



Theorem 2. [6] Let L £ II n . We denote by card(L) the cardinal of the set of vertices of L. 
Then 

maxcard(L) ~ 3 (-) ~ l,40.ra 2/3 
ien n 'n^+oo \7r/ 



Proof 1 (using Theorem 1): we first notice that e(A) vanishes when A goes to infinity. In 
the same time, 

e(A) -Li 2 (l-A) 

1 j ^+-C(2) 1 / 3 (-Li 3 (l-A)) 2 / 3 

Now, we use the equivalent 



to obtain lim^^ +00 c(A) 



x^-oo T(n + 1) 
T(4) 2 / 3 3 



^2x1/3 ^2/3 

t) r(3) 



Since, for large values of A, c(A) becomes arbitrary close to -I73, and the corresponding 



values e(A) remain strictly positive, we get obviously that 

, max Le n n card(L) 3 
hm ml -7- > 



7T- 



ra^+oo 71,2/3 7r2/3 

Now, let £ > 0, and suppose 

limsup max Len card(L) ^ 3(1 + 2 £ ) 



n-t+oo n 2 / 3 7T 2 / 3 

-n\2/3. 



— J ]. 

/ n \ 2/3 

But, for 5 > 0, we observe that by just picking [(1 — 5)3 I — I ] vertices among the vertices 



/ /n\ 2 /3 • 



/ n \ 2/3 

of L, we get a set of convex chains having cardinal [(1 — 5)3 J ] whose cardinal is equal 

- exp ^3^-^j-n 2 / 3 (l + o(l)^ . Choosing 5 small enough to get 



[(1-5)3 



eW)<3?-* 



VT2/3 > ^2/3 

we obtain a contradiction □ 
Proof 2 (direct [6]): Let us denote by \I> the function defined by ^(n) = max^ e n„ card(L). 
This function is increasing, and we shall prove the existence of a sequence a n of integers 

such that: a n -► +00, ^+1 _> 1 a nd *(a n ) ~ 3 (^) 2/3 . 

In order to define a n , we introduce the set X n = X P\ {(x,y) G IR 2 ^ x + y < n}. 

(The introduction of the set X n is also suggested by Proof 1. Indeed, under Q z (x),z(x),\i 
the limiting law of the variables v(x) when A — > 00, is 5\ for x £ X n and 5o f° r # ^ -^n-) 

We define now (a n , a n ) as the sum of the elements of X n . 
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6 

= x: 



Since the set X has an asymptotic density \, we obtain by projection along the line 



V2.a n ~ -= \ 2x 2 dx = V2.^ 

Now, the polygonal line corresponding to the function v n := lx„ G Cq(X) defines an 
element of H an which realizes the maximal number of vertices for an element of U an . In- 
deed, the function u n corresponds to the choice of the card(X n ) first vectors of X, whose 
projections along the line y = x are the smallest, affected with a minimal weight. 

Therefore, ^>(a n ) = card(X n ). 

Now, 

Atv a 6 n 2 3 2 

card(A n ) ~ x — = — x n 

□ 

5. Chains with few vertices 

We turn now to the case of chains having a number of vertices of order n s with s G]0, 2/3 [. 
Proposition 1. For all A > 0, and s e]0, 2/3[, 

\n(J\f(n,n, [cn s ])) = ((2 - 3s)cln(n) + 2c - 3clnc + o(l))n s 

Proof: In this context, we will choose the parameters Zj so that z\ = Z2 = z with z of the 
form z = 1 — j^rj and A = Aofi 3s ~ 2 - Equality (1) then writes 

A ti(i-^r" s(ti+t2) 



n' ^2 ' "' — — . . 



,2s-l 

(1 - (1 - ^)n 1 - s (ti+*2))(l + (A - 1)(1 - _^ rj )n 1 - s (ti+t2)) 



Now, since A tends to 0, the sum on the left hand side is equivalent to 

\mh L { i-e-^V tldt T n 

Using a linear change of variable, Sj := 5tj and the series expansion - — = &x fc , the 

1 — xY ^— ' 



expression of the last integral reduces to ^ . 

As a result the first condition (1) yields Ao ~ 5 3 . 
Let us turn to condition (3): 

(....^i + fA-iKi-^)-'-^' 

In the same way as in the proof of Theorem 1, we deduce 

I r+oo r+cc 6 >(ti +t 2 ) \ 

c?iidt2 x n s ~ cn s 



\mJo Jo l-e-*(*i+fe) 

But the expression of the last integral reduces to Hence, condition (3) yields Ao ~ 5 2 c. 
We get finally 5 ~ c and Ao ~ c 3 . 



cxp 
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Af(n,n, [cn s ]) = (1 - ^)- 2ri A-M x 

E In (l - (1 - A)(l - J^f-^ +t A - In (l - (1 - J^f-^ +t A 

Q Zl)Z2 ,x(U(n,n, [cn s ])) 

This leads to 

N{n, n, [cn s ]) = Q Zl ^ x (U(n, n, [cn s ])) x (X n 3s - 2 )-^ exp (n s (25 + o(l))) 
which gives the result after taking into account both conditions on 5 and Ao, and proving 
Qzi z 2 A(n(n,n, [cn s ])) = 0(— 33-) which is a result analogous to Lemma 2. □ 

Let us observe that we can generalize Proposition 1, in the following way: 

Proposition 2. Let u n be a sequence of positive integers such that u n = 0(71,3) and u n — > 00 , 

jV(n,n,u n ) 



x 



Let us now investigate the limit shape of lines of II(n, n, [cn s ]). 
Let us denote first by P( ninj [ C n s ]) the uniform measure on H(n,n, [cn s ]). 
We introduce the arc of parabola 
C = {(x,y) :y/y = l-y/l-x,0< x < 1} = {(x,y) : y = l(x) : = (1-Vl~x) 2 ,0 < x < 1} 

and its " 5, n" -neighborhood Us jTl , 

U s , n ■= {(x, y)-\y- n.l(x/n)\ < n5} 

The following result answers a question asked by Vershik at the end of [13] (question 3, 
in the concluding remarks). 

Theorem 3. For all S > 0, c > 0, and s g]0,2/3[, 

P(n,n,[cn s }) (-T C Us ;n ) tends to 1 as 

n — > 00 

Proof: Let us choose a subdivision of the interval [0, |], 9^ < 9^ < ... < 9$ such 
that maxjv^ oo (0j v ( _ 1 — 9^) — > 0. Consider Q ZljZ2j \ where zi = z 2 = z with 2 of the form 
z = 1 — and A = Aon 3s ~ 2 , the parameters z and A being chosen according to c and 
s like in the proof of Proposition 1. We introduce the following random variables under 

Qz 1 ,z 2 ,X 



e N = £ jxi+xiv{x). 

xeX:9f <axctan(^)<^ l 

The mathematical expectation of C J,Ar writes 



(ti.*2)e-r: 
ejv <arctan( *2 ) <e^ hl 

and is equivalent when n — ► 00, to 



(1 - (1 - J^)n—(t 1+ t 2)){l + (A _ _ _*_)„i-.( tl+t2)) 



/ 1 r 
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which is itself equivalent, when maxjv->oo(#j+i — Of) — > 0, to 

aN _ aN aN _ aN 

C x - ^ -i ^ x n = C x J +l f ^- x n. 
(cos Of + sin 6>f) 3 cos(0f - f ) 3 

Furthermore, by moment arguments we don't develop since they involve easy but rather long 
computations, one can check that for all positive 77, Q Zl , Z2 ,\(\(,i ,N ~ E[( : '' N ]\ > v) = °(^f)> 
for all fc. 

Therefore, if d(0, w(T)) denotes the distance to the origin of the endpoint of T, we deduce 
the existence of a limit shape for ^Q^pjj x T under Q Zl ,z 2 ,x- 

In the frame based at the intersection of this limit shape with the second diagonal of the 
unit square, with a first axis parallel to the first diagonal and a second axis parallel to the 
second diagonal, the limit shape has the following parametrized description: 

*(0 = Ci.J*^du = tan6 y(0) = C 1 ./ ( f I ^dn = C 1 tan 2 (-f < <f> < f), 
withC 1 = x(f) = ^. 
Hence, y(<j>) = (x(0)) 2 / 'y/2. 

This curve coincides precisely with C, and, Q Zl)Z2y \(T <£_ Us, n ) = o(^), for all k. 
Now, by the same arguments as in Lemma 2, one obtains 

Q Z1 ,Z2,\(^ G ^-(n,n,[Xn s ])) = O(-). 

n 2 

Since the projection of Q Z1 , Z2 ,\ on U^ n>[Xna]) is precisely P{n, n \\n^ we nave S ot for a11 k 



□ 



6. Back to Jarnik's problem 



In [9] , Jarnfk gives an equivalent of the maximum possible number of vertices of a convex 
polygonal line having its vertices in N 2 , a total length smaller than n, and whose segments 

2/3 

make an angle with the x-axis between and j. What he finds is 3- 2 ^1/3 • ^> i n or der to 
be closer to our setting, we ask the segments to make an angle with the x-axis between 

2/3 

and |, Jarmk's equivalent is changed into 3. ^173 (which is twice the above result for n/2). 

In this section, we want to present a detailed combinatorial analysis of this set of lines, 
which leads to Jarnik's result as well as to the asymptotic of the typical cardinal of such 
lines which is the analog of Barany, Sinai and Vershik's result when the constraint concerns 
the total length. 

Let us first describe Jarmk's argument, which is a good application of the correspondence 
described in section 2. It says the following: the function v realizing the maximum can be 
taken among the functions taking their values in {0, 1}. Indeed, by changing the non-zero 
values of a function v into 1, one can obtain a chain with the same number of vertices, but 
with a shorter length. Now, if the number of vertices N is given, the convex chain having 
minimal length, will be defined by the function v which associates 1 to the N points of X 
which are the closest to the origin. Since the set X has an asymptotic density when TV 
is big, this set of points is asymptotically equivalent to the intersection of X with the disc 
of center O having radius R satisfying = N i.e. R = (^-N) 1 / 2 . The total length 
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(-R fi p3 (2zLJ\T)3/2 

of the line is equivalent to L = J Q r x ^.frdr = = — . This yields precisely 

iV = 3.^^1,02L 2 /3. 

Let us notice here, that the limit shape of the line described above is an arc of a circle 
and not an arc of a parabola like in the previous problem. 

Let us turn now to a more precise analysis. We first introduce the following parametrized 
probability distribution on Cq(X): 



f y/ x\+x\ \V(X) \K(x)^0 

WeCo(X), Q z , x (u) = J] 



i — z V x i+ x 2 



(x lX ^x " l + (A-l)zV^I 

where z G [0, 1], and A > 0. 

Now, we define IT(n, N) the set of the polygonal lines of II having N vertices and a total 
length between n and n + 1, and we denote by J\f(n, N) its cardinal. 



Since Q Zj \(v) depends only on y x\ + x\v(x) and ljy(rr)^o> we deduce that the 

x&X x&X 

conditional distribution induced by Q z ,\ on Tl(n, N) is almost uniform. More precisely, its 
density with respect to the uniform measure is bounded between z and 1. So that, we can 
write 

M(n,N) = c(z)z- n X- N J] ( 1 " ZV ^%^ ) ^A(n(n,AT)) (*) 



(xi,x2)ex 



l + (X-l)z^ x 'i +x '\ 



with c(z) € [1, -j]. In order to get a logarithmic equivalent of J\f(n,N), our strategy is to 
choose the parameters z and A according to 



rex (.,,«)« (1 " + (A - l)zV«,+«2) 

^(Ew)- E — ^frg = w < 8 > 

xex (xi,x 2 )eA- I + (A - IJzv i 2 

Again, this will indeed lead to an estimation of Q Z: \(Il(n, N)) which will be much bigger 
than exponentially small. 

Now, our asymptotic setting is the following: n — > +00, and N/n 2 ^ — > c. Similar 
computations as in section 3 yield: 

Theorem 4. For a// A > 0, 

ln(AA(n, [cj(A)n 2 / 3 ])) = n 2 / 3 (ej(A) + o(l)) 
where cj(A) and ej(A) are positive functions on ]0, +00 [, defined by : 

vri/3 ^1/3 
cj(A) = — c(A), ej(A) = — e(A) 

where c(A) and e(A) are the functions introduced in Theorem 1. 
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Corollary 1. Let 5 > 0. We denote by n n ,n+<5 the set of the polygonal lines of'U having a 
total length between n and n + 5, and, by T ritn+ $, the typical number of vertices of an element 
ofH n ,n+5- We have got the three following asymptotics: 

card(n„ iri+5 ) = exp ^^^H n 2 / 3 (l + o(l))j . 

1/3 



1 , r ~ ( ? ^ n 2/3 _ q 55 2/3 

n^so ^4^(3)2 y " - U '™ • 

max card(L) ~ f ) n 2/3 ~ 1, 02n 2/3 . 



Remark: The third asymptotic is a slight refinement of Jarmk's result, since the condition 
on the total length to belong to [n, n + 5] is more restrictive than in Jarmk's case. 

7. Mixing constraints and finding new limit shapes 

In this section, we consider the set H n ,L of convex polygonal lines of H n whose total 
length belongs to [Ln,Ln+ 1], for L e]V%2[. We introduce P n ,L, the uniform distribution 
on H n ,L, and we are investigating the limit shape of a line under P n ^. The shapes 
corresponding to the two extreme values of L, y/2 and 2 are respectively the diagonal and 
the two sides of the square. We want now to explicit a family of convex curves (Cl) Le]V2 2[ 
interpolating these extreme situations (see Figure 2). 

• For L G [f , 2[, there exists a unique a G [— +oo[ such that 

(a+cos it) 3 *^ 

f 4 COS M ^ 

JO (a+cos n) 3 

We define the convex curve Cl of length L, joining O to (1, 1), and symmetric with respect 
to the second diagonal of the unit square by the following parametrized description: 

cosu j r<t> s'mu j 

xlW) = v2 ^ * , y L {(p) = V2 t * (0 < <p < -). 

f 2 COSV_^ u f-2 COSM J u 1 

JO (a+cosn) 3 JO (o+cos u) 3 

When L tends to 2, a tends to — and the curve Cl converges to the two sides of the 
square. 

When L = 1 + '^"^Z^ ~ 1, 623, a = and the limit shape is the arc of parabola found 
in [3], [12], [13]. 

When L tends to | + , a goes to infinity, and Cl converges to the arc of the unit circle 
joining O to (1, 1). 

• For L e]\/2, §], there exists a unique [3 G [1, +oo[ such that 

I 1 4 COSM 7 

Jo (/3-cos«)i« U 
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We define the convex curve Cl of length L, joining O to (1, 1), and symmetric with respect 
to the second diagonal of the unit square by the following parametrized description: 



. .. ^Jo (0-cos(u-l))* Ua 



x L {(f))=V2 ^ s ) yL (0) = V2 ^ s (0 <</><-). 

f2 COSM . p COSM J 2 

JO (/3-cosM) tl " U ' JO (/3-cosu) 3 "" 

When L tends to \/2, /? tends to 1, and the curve C/, converges to the first diagonal of 
the square. 

When L tends to §~, [3 goes to infinity, and Cl converges to the arc of the unit circle 
joining O to (1, 1). 

We can now define the "n, <5" -neighbourhood Vi, n ,s of Cl by 

V L , n , S = {(x,y):d((^,^),C L )<6} 
and state the limit following limit shape result: 

Theorem 5. For all 5 > 0, L g]a/2,2[ ; P„ iL (T C V L , n ,s) tends to 1 as n — > oo 

Proof: We introduce the following parametrized probability distribution Q y z on Cq{X) 
which takes into account both the total length and the endpoint of V: for v G Cq(X), we 
define 

Q yz (v)= Y[ (y^ 1 ^ 2 ^ zV /x l+ x 1y( x )(i _ (yixi+xz) z V x l +x 2)) 

(xi,X2)£X 

where y and z satisfy: for all (xi,X2) in X, y( x i+ x 2) z v / ^i+^I g [0, 1[. (The introduction of 
an extra parameter A in the definition of the probability, that would " count" the vertices is 
useless in a question of limit shape) 

It turns out here that the computation of the parameters according to the conditions 
that the expectations of the endpoint and total length of the chain are respectively equal 
to (n, n) and Ln, are very heavy in this framework. This implies heavy formulas in the 
combinatorics of these lines. 

However, we know that y and z will be of the form 1 H — and 1 H — 373. 



C^= £ ^x\ + x\.v{x). 

xdX : 6»j v <arctan(^) <6 )jV i 

The mathematical expectation of writes 



n 



1/3 



E 



(i - (i + ^3)" 1/3 ( ti+t2 Hi + 1^3 r 173 ^ 1 ) 



0¥ <arctan(i2.)<6l-^ . 
3 yt l' 3 + 1 

and is equivalent when n — ► 00, to 

v /^rq^! e 7(ti+t2)+5 v / ^! 



ill (II 2 x n 

C(2) Jof<e<ef +1 1 - e 7(*i+ta)+ty*i+^ y 



which is itself equivalent, when maxAr_ >00 (6'j v ) _ 1 — 9^) — > 0, to 



#N ^ _ /;.V />.Y 

( 7 (cos#f^sin#f) + 5) 3 ~~ ~ ( T cos(6»f - f ) + <5) 3 



fx- i±i -rxn Cx J+l J 



x n. 
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The concentration of the normalized random chain around a limit shape Cl, which is sym- 
metric with respect to the second diagonal, follows like in the proof of Theorem 3. We derive 
the announced parametrization of Cl from the above estimation of E[^ ,N ]. 

When L > |, we have to take 7 > 0, the variable a equals ^, and the condition on a 
comes from the condition we had to impose on y and z. We proceed similarly for L < \. □ 

8. Some remarks when the "lattice" is random 

Let us replace now the (n + l) 2 points of [0, n] 2 n Z 2 , by a set of n 2 points independently 
and uniformly distributed in [0, n] 2 . It was observed in [5] that the probability for k points 
independently and uniformly distributed in the square [0, l] 2 to form, with (0, 0) and (1, 1), 

a convex chain is exactly equal to — — . From this result, we can derive an analog of 

kl(k + 1)! 

Theorems 1 and 3, for the expected number of chains. Indeed, one can easily deduce that 
for s £ [0, 1] and c > 0, the expectation of the number of chains, having a cardinal equal to 
[cn s ], is equal to 

,2 



n 



\cn a 



n 



c(2-3sK eX p((3c - 3c In c)n s + o(n s )) (9) 



([cn s ])\([cn s ] + 1)! 

which is maximal for c = 1 and s = 2/3, and, in this case, the expected number of convex 
chains of cardinal [n 2//3 ] is equal to exp(3n 2 / 3 + o(n 2 / 3 )). It is straightforward to see that the 
expected number of the total number of convex chains is also equal to exp(3n 2 / 3 + o(n 2 / 3 )), 
and a moment method show that, in probability, the cardinal of a typical convex chain is 
equivalent to n 2 / 3 . Let us remark finally that, in the case of chains having cn s vertices, with 
s < 2/3, the main term n c ( 2_3s ) nS is common to both Theorem 3 and formula (9). 

Now, the question of the largest cardinal of a convex chain is delicate in the random case. 
We believe but we are not able to prove that there is an equivalent, in probability, of this 
quantity when n goes to infinity. Indeed, Kingman's sub-additive ergodic theorem is of no 
direct help here, since the concatenation of two convex chains is not necessarily convex. 
Nevertheless, from (9), one can say, that, with an asymptotically overwhelming probability, 
this cardinal is less than e.n 2//3 . 

One can even slightly ameliorate this result in the following manner: suppose there exists 
d e]0, e[, such that with an asymptotically positive probability, there will be a chain having 
length [dn 2 / 3 ]. When such a chain exists, since every sub-chain of a convex chain is also 
a convex chain, we get that, for all c g]0, d[, there are, with an asymptotically positive 

probability, at least ('^2/3]) convex chains having cardinal [cn 2 / 3 ]. Therefore, the expected 

number of such chains is at least exp (^({d — c) In -^- c + cln |)n + o(nfj . Now, using (9), the 
real d must satisfy that, for all c g]0, d[, 

{(d- c)\n-^— +cln^) <3c(l-lnc) 
Some numerical computations yield d < 2, 65. 

We deduce that, in the context of n random uniformly distributed points, the longest 
convex chain, is with an asymptotically overwhelming probability, smaller than 2,65.n 2//3 , 
which is, as we intuitively expected, bigger than the l,40.n 2 / 3 found in Theorem 2. Note 
that the lower bound obtained by just considering the extremal points of the convex hull of 
the points of the Poisson process which are situated above the parabola, which is, by a result 
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of Renyi and Sulanke [10] and their followers, equivalent to ^1/3 r(5/3)n 2 / 3 
does not give a better lower bound than the typical behavior. 
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0.2 0.4 0.6 



1.2 1.4 



Fig. 1: The graph of (c(A),e(A)). 
The point of maximal e-coordinate corresponds to typical chains [3] [12] [13]. 
The point of maximal c-coordinate corresponds to chains of maximal cardinal [6]. 



0.8 



0.6 



. 4 



0.2 




Fig. 2: Limit shapes of different lengths. Successively: 
L = V2 (diagonal), L = 1,454, L = 1,516, L = f (circle), L = 2 2/3 (parabola), 
L = 1, 716, L = 1, 861, and L = 2 (sides of the square). 
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